Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 506 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 63.4 KiB |
| Average record size in memory | 128.3 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 14 |
TOWN has a high cardinality: 92 distinct values | High cardinality |
TRACT is highly overall correlated with MEDV and 9 other fields | High correlation |
LON is highly overall correlated with TOWN | High correlation |
LAT is highly overall correlated with TOWN | High correlation |
MEDV is highly overall correlated with TRACT and 7 other fields | High correlation |
CRIM is highly overall correlated with TRACT and 8 other fields | High correlation |
ZN is highly overall correlated with CRIM and 5 other fields | High correlation |
INDUS is highly overall correlated with TRACT and 8 other fields | High correlation |
NOX is highly overall correlated with TRACT and 9 other fields | High correlation |
RM is highly overall correlated with MEDV | High correlation |
AGE is highly overall correlated with TRACT and 7 other fields | High correlation |
DIS is highly overall correlated with TRACT and 7 other fields | High correlation |
RAD is highly overall correlated with TRACT and 4 other fields | High correlation |
TAX is highly overall correlated with TRACT and 8 other fields | High correlation |
PTRATIO is highly overall correlated with TRACT and 2 other fields | High correlation |
TOWN is highly overall correlated with TRACT and 9 other fields | High correlation |
CHAS is highly imbalanced (63.7%) | Imbalance |
TRACT has unique values | Unique |
ZN has 372 (73.5%) zeros | Zeros |
Reproduction
| Analysis started | 2023-07-18 21:51:34.492263 |
|---|---|
| Analysis finished | 2023-07-18 21:52:43.107930 |
| Duration | 1 minute and 8.62 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
TOWN
Categorical
HIGH CARDINALITY  HIGH CORRELATION 
| Distinct | 92 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| Cambridge | 30 |
|---|---|
| Boston Savin Hill | 23 |
| Lynn | 22 |
| Boston Roxbury | 19 |
| Newton | 18 |
| Other values (87) |
Length
| Max length | 23 |
|---|---|
| Median length | 18 |
| Mean length | 9.9743083 |
| Min length | 4 |
Characters and Unicode
| Total characters | 5047 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | 3.4% |
Sample
| 1st row | Nahant |
|---|---|
| 2nd row | Swampscott |
| 3rd row | Swampscott |
| 4th row | Marblehead |
| 5th row | Marblehead |
Common Values
| Value | Count | Frequency (%) |
| Cambridge | 30 | 5.9% |
| Boston Savin Hill | 23 | 4.5% |
| Lynn | 22 | 4.3% |
| Boston Roxbury | 19 | 3.8% |
| Newton | 18 | 3.6% |
| Somerville | 15 | 3.0% |
| Boston South Boston | 13 | 2.6% |
| Quincy | 12 | 2.4% |
| Brookline | 12 | 2.4% |
| Boston East Boston | 12 | 2.4% |
| Other values (82) | 330 |
Length
| Value | Count | Frequency (%) |
| boston | 157 | |
| cambridge | 30 | 4.2% |
| hill | 26 | 3.6% |
| savin | 23 | 3.2% |
| roxbury | 23 | 3.2% |
| lynn | 22 | 3.1% |
| newton | 18 | 2.5% |
| somerville | 15 | 2.1% |
| south | 13 | 1.8% |
| quincy | 12 | 1.7% |
| Other values (87) | 375 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 618 | 12.2% |
| n | 465 | 9.2% |
| t | 389 | 7.7% |
| e | 378 | 7.5% |
| a | 270 | 5.3% |
| r | 264 | 5.2% |
| s | 254 | 5.0% |
| l | 250 | 5.0% |
| B | 220 | 4.4% |
| i | 219 | 4.3% |
| Other values (31) | 1720 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4109 | |
| Uppercase Letter | 722 | 14.3% |
| Space Separator | 208 | 4.1% |
| Dash Punctuation | 8 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 618 | |
| n | 465 | |
| t | 389 | |
| e | 378 | |
| a | 270 | 6.6% |
| r | 264 | 6.4% |
| s | 254 | 6.2% |
| l | 250 | 6.1% |
| i | 219 | 5.3% |
| d | 134 | 3.3% |
| Other values (13) | 868 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 220 | |
| S | 75 | 10.4% |
| W | 65 | 9.0% |
| C | 48 | 6.6% |
| H | 44 | 6.1% |
| M | 43 | 6.0% |
| R | 42 | 5.8% |
| N | 41 | 5.7% |
| L | 31 | 4.3% |
| D | 30 | 4.2% |
| Other values (6) | 83 | 11.5% |
Space Separator
| Value | Count | Frequency (%) |
| 208 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4831 | |
| Common | 216 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 618 | |
| n | 465 | 9.6% |
| t | 389 | 8.1% |
| e | 378 | 7.8% |
| a | 270 | 5.6% |
| r | 264 | 5.5% |
| s | 254 | 5.3% |
| l | 250 | 5.2% |
| B | 220 | 4.6% |
| i | 219 | 4.5% |
| Other values (29) | 1504 |
Common
| Value | Count | Frequency (%) |
| 208 | ||
| - | 8 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5047 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 618 | 12.2% |
| n | 465 | 9.2% |
| t | 389 | 7.7% |
| e | 378 | 7.5% |
| a | 270 | 5.3% |
| r | 264 | 5.2% |
| s | 254 | 5.0% |
| l | 250 | 5.0% |
| B | 220 | 4.4% |
| i | 219 | 4.3% |
| Other values (31) | 1720 |
TRACT
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 506 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2700.3557 |
| Minimum | 1 |
|---|---|
| Maximum | 5082 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 430.5 |
| Q1 | 1303.25 |
| median | 3393.5 |
| Q3 | 3739.75 |
| 95-th percentile | 4202.75 |
| Maximum | 5082 |
| Range | 5081 |
| Interquartile range (IQR) | 2436.5 |
Descriptive statistics
| Standard deviation | 1380.0368 |
|---|---|
| Coefficient of variation (CV) | 0.51105742 |
| Kurtosis | -1.1960953 |
| Mean | 2700.3557 |
| Median Absolute Deviation (MAD) | 787 |
| Skewness | -0.43580814 |
| Sum | 1366380 |
| Variance | 1904501.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2011 | 1 | 0.2% |
| 4212 | 1 | 0.2% |
| 5021 | 1 | 0.2% |
| 5012 | 1 | 0.2% |
| 5011 | 1 | 0.2% |
| 5001 | 1 | 0.2% |
| 4231 | 1 | 0.2% |
| 4228 | 1 | 0.2% |
| 4227 | 1 | 0.2% |
| 4226 | 1 | 0.2% |
| Other values (496) | 496 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 101 | 1 | |
| 102 | 1 |
| Value | Count | Frequency (%) |
| 5082 | 1 | |
| 5081 | 1 | |
| 5071 | 1 | |
| 5062 | 1 | |
| 5061 | 1 | |
| 5052 | 1 | |
| 5051 | 1 | |
| 5041 | 1 | |
| 5031 | 1 | |
| 5022 | 1 |
LON
Real number (ℝ)
| Distinct | 375 |
|---|---|
| Distinct (%) | 74.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -71.056389 |
| Minimum | -71.2895 |
|---|---|
| Maximum | -70.81 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 506 |
| Negative (%) | 100.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | -71.2895 |
|---|---|
| 5-th percentile | -71.202375 |
| Q1 | -71.093225 |
| median | -71.0529 |
| Q3 | -71.019625 |
| 95-th percentile | -70.936 |
| Maximum | -70.81 |
| Range | 0.4795 |
| Interquartile range (IQR) | 0.0736 |
Descriptive statistics
| Standard deviation | 0.075405348 |
|---|---|
| Coefficient of variation (CV) | -0.0010612043 |
| Kurtosis | 1.1084808 |
| Mean | -71.056389 |
| Median Absolute Deviation (MAD) | 0.0371 |
| Skewness | -0.20538473 |
| Sum | -35954.533 |
| Variance | 0.0056859665 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -71.069 | 5 | 1.0% |
| -71.03 | 4 | 0.8% |
| -71.02 | 4 | 0.8% |
| -71.0455 | 4 | 0.8% |
| -71.055 | 4 | 0.8% |
| -71.059 | 4 | 0.8% |
| -71.04 | 4 | 0.8% |
| -71.075 | 4 | 0.8% |
| -71.11 | 3 | 0.6% |
| -71.09 | 3 | 0.6% |
| Other values (365) | 467 |
| Value | Count | Frequency (%) |
| -71.2895 | 1 | |
| -71.2807 | 1 | |
| -71.269 | 1 | |
| -71.2685 | 1 | |
| -71.263 | 1 | |
| -71.262 | 1 | |
| -71.2575 | 1 | |
| -71.255 | 1 | |
| -71.2475 | 1 | |
| -71.247 | 1 |
| Value | Count | Frequency (%) |
| -70.81 | 1 | |
| -70.83 | 2 | |
| -70.833 | 1 | |
| -70.8525 | 1 | |
| -70.853 | 1 | |
| -70.855 | 1 | |
| -70.86 | 1 | |
| -70.8875 | 1 | |
| -70.9075 | 1 | |
| -70.915 | 1 |
LAT
Real number (ℝ)
| Distinct | 376 |
|---|---|
| Distinct (%) | 74.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.21644 |
| Minimum | 42.03 |
|---|---|
| Maximum | 42.381 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 42.03 |
|---|---|
| 5-th percentile | 42.10745 |
| Q1 | 42.180775 |
| median | 42.2181 |
| Q3 | 42.25225 |
| 95-th percentile | 42.31985 |
| Maximum | 42.381 |
| Range | 0.351 |
| Interquartile range (IQR) | 0.071475 |
Descriptive statistics
| Standard deviation | 0.061777184 |
|---|---|
| Coefficient of variation (CV) | 0.0014633442 |
| Kurtosis | 0.10400249 |
| Mean | 42.21644 |
| Median Absolute Deviation (MAD) | 0.03625 |
| Skewness | -0.086678598 |
| Sum | 21361.519 |
| Variance | 0.0038164205 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 42.23 | 5 | 1.0% |
| 42.192 | 4 | 0.8% |
| 42.245 | 4 | 0.8% |
| 42.188 | 4 | 0.8% |
| 42.2075 | 4 | 0.8% |
| 42.255 | 3 | 0.6% |
| 42.225 | 3 | 0.6% |
| 42.2875 | 3 | 0.6% |
| 42.169 | 3 | 0.6% |
| 42.305 | 3 | 0.6% |
| Other values (366) | 470 |
| Value | Count | Frequency (%) |
| 42.03 | 1 | |
| 42.0485 | 1 | |
| 42.052 | 1 | |
| 42.059 | 2 | |
| 42.0675 | 1 | |
| 42.0725 | 2 | |
| 42.0735 | 1 | |
| 42.0775 | 2 | |
| 42.0795 | 1 | |
| 42.0825 | 1 |
| Value | Count | Frequency (%) |
| 42.381 | 1 | |
| 42.374 | 1 | |
| 42.3715 | 2 | |
| 42.3525 | 1 | |
| 42.346 | 2 | |
| 42.345 | 2 | |
| 42.3425 | 1 | |
| 42.34 | 1 | |
| 42.339 | 1 | |
| 42.3382 | 1 |
MEDV
Real number (ℝ)
| Distinct | 228 |
|---|---|
| Distinct (%) | 45.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.528854 |
| Minimum | 5 |
|---|---|
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 10.2 |
| Q1 | 17.025 |
| median | 21.2 |
| Q3 | 25 |
| 95-th percentile | 43.4 |
| Maximum | 50 |
| Range | 45 |
| Interquartile range (IQR) | 7.975 |
Descriptive statistics
| Standard deviation | 9.1821759 |
|---|---|
| Coefficient of variation (CV) | 0.40757404 |
| Kurtosis | 1.5167834 |
| Mean | 22.528854 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.1109119 |
| Sum | 11399.6 |
| Variance | 84.312354 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 16 | 3.2% |
| 25 | 8 | 1.6% |
| 23.1 | 7 | 1.4% |
| 21.7 | 7 | 1.4% |
| 19.4 | 6 | 1.2% |
| 20.6 | 6 | 1.2% |
| 22 | 6 | 1.2% |
| 21.4 | 5 | 1.0% |
| 21.2 | 5 | 1.0% |
| 19.3 | 5 | 1.0% |
| Other values (218) | 435 |
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 5.6 | 1 | 0.2% |
| 6.3 | 1 | 0.2% |
| 7 | 2 | |
| 7.2 | 3 | |
| 7.4 | 1 | 0.2% |
| 7.5 | 1 | 0.2% |
| 8.1 | 1 | 0.2% |
| 8.2 | 1 | 0.2% |
| 8.3 | 2 |
| Value | Count | Frequency (%) |
| 50 | 16 | |
| 48.8 | 1 | 0.2% |
| 48.5 | 1 | 0.2% |
| 48.3 | 1 | 0.2% |
| 46.7 | 1 | 0.2% |
| 46 | 1 | 0.2% |
| 45.4 | 1 | 0.2% |
| 44.8 | 1 | 0.2% |
| 44 | 1 | 0.2% |
| 43.8 | 1 | 0.2% |
CRIM
Real number (ℝ)
| Distinct | 504 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6135236 |
| Minimum | 0.00632 |
|---|---|
| Maximum | 88.9762 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.00632 |
|---|---|
| 5-th percentile | 0.02791 |
| Q1 | 0.082045 |
| median | 0.25651 |
| Q3 | 3.6770825 |
| 95-th percentile | 15.78915 |
| Maximum | 88.9762 |
| Range | 88.96988 |
| Interquartile range (IQR) | 3.5950375 |
Descriptive statistics
| Standard deviation | 8.6015451 |
|---|---|
| Coefficient of variation (CV) | 2.3803761 |
| Kurtosis | 37.130509 |
| Mean | 3.6135236 |
| Median Absolute Deviation (MAD) | 0.22145 |
| Skewness | 5.2231488 |
| Sum | 1828.4429 |
| Variance | 73.986578 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01501 | 2 | 0.4% |
| 14.3337 | 2 | 0.4% |
| 0.03466 | 1 | 0.2% |
| 0.03113 | 1 | 0.2% |
| 0.03049 | 1 | 0.2% |
| 0.02543 | 1 | 0.2% |
| 0.02498 | 1 | 0.2% |
| 0.01301 | 1 | 0.2% |
| 0.06151 | 1 | 0.2% |
| 0.05497 | 1 | 0.2% |
| Other values (494) | 494 |
| Value | Count | Frequency (%) |
| 0.00632 | 1 | |
| 0.00906 | 1 | |
| 0.01096 | 1 | |
| 0.01301 | 1 | |
| 0.01311 | 1 | |
| 0.0136 | 1 | |
| 0.01381 | 1 | |
| 0.01432 | 1 | |
| 0.01439 | 1 | |
| 0.01501 | 2 |
| Value | Count | Frequency (%) |
| 88.9762 | 1 | |
| 73.5341 | 1 | |
| 67.9208 | 1 | |
| 51.1358 | 1 | |
| 45.7461 | 1 | |
| 41.5292 | 1 | |
| 38.3518 | 1 | |
| 37.6619 | 1 | |
| 28.6558 | 1 | |
| 25.9406 | 1 |
ZN
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 26 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.363636 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 372 |
| Zeros (%) | 73.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 12.5 |
| 95-th percentile | 80 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 12.5 |
Descriptive statistics
| Standard deviation | 23.322453 |
|---|---|
| Coefficient of variation (CV) | 2.0523759 |
| Kurtosis | 4.0315101 |
| Mean | 11.363636 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.2256663 |
| Sum | 5750 |
| Variance | 543.93681 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 372 | |
| 20 | 21 | 4.2% |
| 80 | 15 | 3.0% |
| 22 | 10 | 2.0% |
| 12.5 | 10 | 2.0% |
| 25 | 10 | 2.0% |
| 40 | 7 | 1.4% |
| 45 | 6 | 1.2% |
| 30 | 6 | 1.2% |
| 90 | 5 | 1.0% |
| Other values (16) | 44 | 8.7% |
| Value | Count | Frequency (%) |
| 0 | 372 | |
| 12.5 | 10 | 2.0% |
| 17.5 | 1 | 0.2% |
| 18 | 1 | 0.2% |
| 20 | 21 | 4.2% |
| 21 | 4 | 0.8% |
| 22 | 10 | 2.0% |
| 25 | 10 | 2.0% |
| 28 | 3 | 0.6% |
| 30 | 6 | 1.2% |
| Value | Count | Frequency (%) |
| 100 | 1 | 0.2% |
| 95 | 4 | 0.8% |
| 90 | 5 | 1.0% |
| 85 | 2 | 0.4% |
| 82.5 | 2 | 0.4% |
| 80 | 15 | |
| 75 | 3 | 0.6% |
| 70 | 3 | 0.6% |
| 60 | 4 | 0.8% |
| 55 | 3 | 0.6% |
INDUS
Real number (ℝ)
| Distinct | 76 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.136779 |
| Minimum | 0.46 |
|---|---|
| Maximum | 27.74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.46 |
|---|---|
| 5-th percentile | 2.18 |
| Q1 | 5.19 |
| median | 9.69 |
| Q3 | 18.1 |
| 95-th percentile | 21.89 |
| Maximum | 27.74 |
| Range | 27.28 |
| Interquartile range (IQR) | 12.91 |
Descriptive statistics
| Standard deviation | 6.8603529 |
|---|---|
| Coefficient of variation (CV) | 0.61600874 |
| Kurtosis | -1.2335396 |
| Mean | 11.136779 |
| Median Absolute Deviation (MAD) | 6.32 |
| Skewness | 0.29502157 |
| Sum | 5635.21 |
| Variance | 47.064442 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18.1 | 132 | |
| 19.58 | 30 | 5.9% |
| 8.14 | 22 | 4.3% |
| 6.2 | 18 | 3.6% |
| 21.89 | 15 | 3.0% |
| 3.97 | 12 | 2.4% |
| 9.9 | 12 | 2.4% |
| 8.56 | 11 | 2.2% |
| 10.59 | 11 | 2.2% |
| 5.86 | 10 | 2.0% |
| Other values (66) | 233 |
| Value | Count | Frequency (%) |
| 0.46 | 1 | 0.2% |
| 0.74 | 1 | 0.2% |
| 1.21 | 1 | 0.2% |
| 1.22 | 1 | 0.2% |
| 1.25 | 2 | |
| 1.32 | 1 | 0.2% |
| 1.38 | 1 | 0.2% |
| 1.47 | 2 | |
| 1.52 | 4 | |
| 1.69 | 2 |
| Value | Count | Frequency (%) |
| 27.74 | 5 | 1.0% |
| 25.65 | 7 | 1.4% |
| 21.89 | 15 | 3.0% |
| 19.58 | 30 | 5.9% |
| 18.1 | 132 | |
| 15.04 | 3 | 0.6% |
| 13.92 | 5 | 1.0% |
| 13.89 | 4 | 0.8% |
| 12.83 | 6 | 1.2% |
| 11.93 | 5 | 1.0% |
CHAS
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| 0 | |
|---|---|
| 1 | 35 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 506 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 506 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 506 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 506 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
NOX
Real number (ℝ)
| Distinct | 81 |
|---|---|
| Distinct (%) | 16.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.55469506 |
| Minimum | 0.385 |
|---|---|
| Maximum | 0.871 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.385 |
|---|---|
| 5-th percentile | 0.40925 |
| Q1 | 0.449 |
| median | 0.538 |
| Q3 | 0.624 |
| 95-th percentile | 0.74 |
| Maximum | 0.871 |
| Range | 0.486 |
| Interquartile range (IQR) | 0.175 |
Descriptive statistics
| Standard deviation | 0.11587768 |
|---|---|
| Coefficient of variation (CV) | 0.20890339 |
| Kurtosis | -0.064667133 |
| Mean | 0.55469506 |
| Median Absolute Deviation (MAD) | 0.0875 |
| Skewness | 0.72930792 |
| Sum | 280.6757 |
| Variance | 0.013427636 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.538 | 23 | 4.5% |
| 0.713 | 18 | 3.6% |
| 0.437 | 17 | 3.4% |
| 0.871 | 16 | 3.2% |
| 0.624 | 15 | 3.0% |
| 0.489 | 15 | 3.0% |
| 0.693 | 14 | 2.8% |
| 0.605 | 14 | 2.8% |
| 0.74 | 13 | 2.6% |
| 0.544 | 12 | 2.4% |
| Other values (71) | 349 |
| Value | Count | Frequency (%) |
| 0.385 | 1 | 0.2% |
| 0.389 | 1 | 0.2% |
| 0.392 | 2 | |
| 0.394 | 1 | 0.2% |
| 0.398 | 2 | |
| 0.4 | 4 | |
| 0.401 | 3 | |
| 0.403 | 3 | |
| 0.404 | 3 | |
| 0.405 | 3 |
| Value | Count | Frequency (%) |
| 0.871 | 16 | |
| 0.77 | 8 | |
| 0.74 | 13 | |
| 0.718 | 6 | 1.2% |
| 0.713 | 18 | |
| 0.7 | 11 | |
| 0.693 | 14 | |
| 0.679 | 8 | |
| 0.671 | 7 | 1.4% |
| 0.668 | 3 | 0.6% |
RM
Real number (ℝ)
| Distinct | 446 |
|---|---|
| Distinct (%) | 88.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.2846344 |
| Minimum | 3.561 |
|---|---|
| Maximum | 8.78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 3.561 |
|---|---|
| 5-th percentile | 5.314 |
| Q1 | 5.8855 |
| median | 6.2085 |
| Q3 | 6.6235 |
| 95-th percentile | 7.5875 |
| Maximum | 8.78 |
| Range | 5.219 |
| Interquartile range (IQR) | 0.738 |
Descriptive statistics
| Standard deviation | 0.70261714 |
|---|---|
| Coefficient of variation (CV) | 0.11179921 |
| Kurtosis | 1.8915004 |
| Mean | 6.2846344 |
| Median Absolute Deviation (MAD) | 0.3455 |
| Skewness | 0.40361213 |
| Sum | 3180.025 |
| Variance | 0.49367085 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.713 | 3 | 0.6% |
| 6.167 | 3 | 0.6% |
| 6.127 | 3 | 0.6% |
| 6.229 | 3 | 0.6% |
| 6.405 | 3 | 0.6% |
| 6.417 | 3 | 0.6% |
| 6.782 | 2 | 0.4% |
| 6.951 | 2 | 0.4% |
| 6.63 | 2 | 0.4% |
| 6.312 | 2 | 0.4% |
| Other values (436) | 480 |
| Value | Count | Frequency (%) |
| 3.561 | 1 | |
| 3.863 | 1 | |
| 4.138 | 2 | |
| 4.368 | 1 | |
| 4.519 | 1 | |
| 4.628 | 1 | |
| 4.652 | 1 | |
| 4.88 | 1 | |
| 4.903 | 1 | |
| 4.906 | 1 |
| Value | Count | Frequency (%) |
| 8.78 | 1 | |
| 8.725 | 1 | |
| 8.704 | 1 | |
| 8.398 | 1 | |
| 8.375 | 1 | |
| 8.337 | 1 | |
| 8.297 | 1 | |
| 8.266 | 1 | |
| 8.259 | 1 | |
| 8.247 | 1 |
AGE
Real number (ℝ)
| Distinct | 356 |
|---|---|
| Distinct (%) | 70.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.574901 |
| Minimum | 2.9 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 2.9 |
|---|---|
| 5-th percentile | 17.725 |
| Q1 | 45.025 |
| median | 77.5 |
| Q3 | 94.075 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 97.1 |
| Interquartile range (IQR) | 49.05 |
Descriptive statistics
| Standard deviation | 28.148861 |
|---|---|
| Coefficient of variation (CV) | 0.41048344 |
| Kurtosis | -0.96771559 |
| Mean | 68.574901 |
| Median Absolute Deviation (MAD) | 19.55 |
| Skewness | -0.59896264 |
| Sum | 34698.9 |
| Variance | 792.3584 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 43 | 8.5% |
| 95.4 | 4 | 0.8% |
| 96 | 4 | 0.8% |
| 98.2 | 4 | 0.8% |
| 97.9 | 4 | 0.8% |
| 98.8 | 4 | 0.8% |
| 87.9 | 4 | 0.8% |
| 95.6 | 3 | 0.6% |
| 97 | 3 | 0.6% |
| 21.4 | 3 | 0.6% |
| Other values (346) | 430 |
| Value | Count | Frequency (%) |
| 2.9 | 1 | |
| 6 | 1 | |
| 6.2 | 1 | |
| 6.5 | 1 | |
| 6.6 | 2 | |
| 6.8 | 1 | |
| 7.8 | 2 | |
| 8.4 | 1 | |
| 8.9 | 1 | |
| 9.8 | 1 |
| Value | Count | Frequency (%) |
| 100 | 43 | |
| 99.3 | 1 | 0.2% |
| 99.1 | 1 | 0.2% |
| 98.9 | 3 | 0.6% |
| 98.8 | 4 | 0.8% |
| 98.7 | 1 | 0.2% |
| 98.5 | 1 | 0.2% |
| 98.4 | 2 | 0.4% |
| 98.3 | 2 | 0.4% |
| 98.2 | 4 | 0.8% |
DIS
Real number (ℝ)
| Distinct | 412 |
|---|---|
| Distinct (%) | 81.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.7950427 |
| Minimum | 1.1296 |
|---|---|
| Maximum | 12.1265 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1.1296 |
|---|---|
| 5-th percentile | 1.461975 |
| Q1 | 2.100175 |
| median | 3.20745 |
| Q3 | 5.188425 |
| 95-th percentile | 7.8278 |
| Maximum | 12.1265 |
| Range | 10.9969 |
| Interquartile range (IQR) | 3.08825 |
Descriptive statistics
| Standard deviation | 2.1057101 |
|---|---|
| Coefficient of variation (CV) | 0.55485809 |
| Kurtosis | 0.48794112 |
| Mean | 3.7950427 |
| Median Absolute Deviation (MAD) | 1.29115 |
| Skewness | 1.0117806 |
| Sum | 1920.2916 |
| Variance | 4.4340151 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.4952 | 5 | 1.0% |
| 5.7209 | 4 | 0.8% |
| 5.2873 | 4 | 0.8% |
| 6.8147 | 4 | 0.8% |
| 5.4007 | 4 | 0.8% |
| 6.3361 | 3 | 0.6% |
| 3.9454 | 3 | 0.6% |
| 6.498 | 3 | 0.6% |
| 4.7211 | 3 | 0.6% |
| 4.8122 | 3 | 0.6% |
| Other values (402) | 470 |
| Value | Count | Frequency (%) |
| 1.1296 | 1 | |
| 1.137 | 1 | |
| 1.1691 | 1 | |
| 1.1742 | 1 | |
| 1.1781 | 1 | |
| 1.2024 | 1 | |
| 1.2852 | 1 | |
| 1.3163 | 1 | |
| 1.3216 | 1 | |
| 1.3325 | 1 |
| Value | Count | Frequency (%) |
| 12.1265 | 1 | |
| 10.7103 | 2 | |
| 10.5857 | 2 | |
| 9.2229 | 1 | |
| 9.2203 | 2 | |
| 9.1876 | 1 | |
| 9.0892 | 1 | |
| 8.9067 | 2 | |
| 8.7921 | 2 | |
| 8.6966 | 1 |
RAD
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.5494071 |
| Minimum | 1 |
|---|---|
| Maximum | 24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 24 |
| 95-th percentile | 24 |
| Maximum | 24 |
| Range | 23 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 8.7072594 |
|---|---|
| Coefficient of variation (CV) | 0.91181152 |
| Kurtosis | -0.86723199 |
| Mean | 9.5494071 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.0048146 |
| Sum | 4832 |
| Variance | 75.816366 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 132 | |
| 5 | 115 | |
| 4 | 110 | |
| 3 | 38 | 7.5% |
| 6 | 26 | 5.1% |
| 2 | 24 | 4.7% |
| 8 | 24 | 4.7% |
| 1 | 20 | 4.0% |
| 7 | 17 | 3.4% |
| Value | Count | Frequency (%) |
| 1 | 20 | 4.0% |
| 2 | 24 | 4.7% |
| 3 | 38 | 7.5% |
| 4 | 110 | |
| 5 | 115 | |
| 6 | 26 | 5.1% |
| 7 | 17 | 3.4% |
| 8 | 24 | 4.7% |
| 24 | 132 |
| Value | Count | Frequency (%) |
| 24 | 132 | |
| 8 | 24 | 4.7% |
| 7 | 17 | 3.4% |
| 6 | 26 | 5.1% |
| 5 | 115 | |
| 4 | 110 | |
| 3 | 38 | 7.5% |
| 2 | 24 | 4.7% |
| 1 | 20 | 4.0% |
TAX
Real number (ℝ)
| Distinct | 66 |
|---|---|
| Distinct (%) | 13.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 408.23715 |
| Minimum | 187 |
|---|---|
| Maximum | 711 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 187 |
|---|---|
| 5-th percentile | 222 |
| Q1 | 279 |
| median | 330 |
| Q3 | 666 |
| 95-th percentile | 666 |
| Maximum | 711 |
| Range | 524 |
| Interquartile range (IQR) | 387 |
Descriptive statistics
| Standard deviation | 168.53712 |
|---|---|
| Coefficient of variation (CV) | 0.4128412 |
| Kurtosis | -1.142408 |
| Mean | 408.23715 |
| Median Absolute Deviation (MAD) | 73 |
| Skewness | 0.66995594 |
| Sum | 206568 |
| Variance | 28404.759 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 666 | 132 | |
| 307 | 40 | 7.9% |
| 403 | 30 | 5.9% |
| 437 | 15 | 3.0% |
| 304 | 14 | 2.8% |
| 264 | 12 | 2.4% |
| 398 | 12 | 2.4% |
| 384 | 11 | 2.2% |
| 277 | 11 | 2.2% |
| 224 | 10 | 2.0% |
| Other values (56) | 219 |
| Value | Count | Frequency (%) |
| 187 | 1 | 0.2% |
| 188 | 7 | |
| 193 | 8 | |
| 198 | 1 | 0.2% |
| 216 | 5 | |
| 222 | 7 | |
| 223 | 5 | |
| 224 | 10 | |
| 226 | 1 | 0.2% |
| 233 | 9 |
| Value | Count | Frequency (%) |
| 711 | 5 | 1.0% |
| 666 | 132 | |
| 469 | 1 | 0.2% |
| 437 | 15 | 3.0% |
| 432 | 9 | 1.8% |
| 430 | 3 | 0.6% |
| 422 | 1 | 0.2% |
| 411 | 2 | 0.4% |
| 403 | 30 | 5.9% |
| 402 | 2 | 0.4% |
PTRATIO
Real number (ℝ)
| Distinct | 46 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.455534 |
| Minimum | 12.6 |
|---|---|
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 12.6 |
|---|---|
| 5-th percentile | 14.7 |
| Q1 | 17.4 |
| median | 19.05 |
| Q3 | 20.2 |
| 95-th percentile | 21 |
| Maximum | 22 |
| Range | 9.4 |
| Interquartile range (IQR) | 2.8 |
Descriptive statistics
| Standard deviation | 2.1649455 |
|---|---|
| Coefficient of variation (CV) | 0.11730604 |
| Kurtosis | -0.28509138 |
| Mean | 18.455534 |
| Median Absolute Deviation (MAD) | 1.15 |
| Skewness | -0.80232493 |
| Sum | 9338.5 |
| Variance | 4.6869891 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20.2 | 140 | |
| 14.7 | 34 | 6.7% |
| 21 | 27 | 5.3% |
| 17.8 | 23 | 4.5% |
| 19.2 | 19 | 3.8% |
| 17.4 | 18 | 3.6% |
| 18.6 | 17 | 3.4% |
| 19.1 | 17 | 3.4% |
| 18.4 | 16 | 3.2% |
| 16.6 | 16 | 3.2% |
| Other values (36) | 179 |
| Value | Count | Frequency (%) |
| 12.6 | 3 | 0.6% |
| 13 | 12 | 2.4% |
| 13.6 | 1 | 0.2% |
| 14.4 | 1 | 0.2% |
| 14.7 | 34 | |
| 14.8 | 3 | 0.6% |
| 14.9 | 4 | 0.8% |
| 15.1 | 1 | 0.2% |
| 15.2 | 13 | 2.6% |
| 15.3 | 3 | 0.6% |
| Value | Count | Frequency (%) |
| 22 | 2 | 0.4% |
| 21.2 | 15 | 3.0% |
| 21.1 | 1 | 0.2% |
| 21 | 27 | 5.3% |
| 20.9 | 11 | 2.2% |
| 20.2 | 140 | |
| 20.1 | 5 | 1.0% |
| 19.7 | 8 | 1.6% |
| 19.6 | 8 | 1.6% |
| 19.2 | 19 | 3.8% |